Background: Recently, it has become possible to collect next-generation DNA\nsequencing data sets that are composed of multiple samples from multiple biological\nunits where each of these samples may be from a single cell or bulk tissue. Yet, there\ndoes not yet exist a tool for simulating DNA sequencing data from such a nested\nsampling arrangement with single-cell and bulk samples so that developers of analysis\nmethods can assess accuracy and precision.\nResults: We have developed a tool that simulates DNA sequencing data from\nhierarchically grouped (correlated) samples where each sample is designated bulk or\nsingle-cell. Our tool uses a simple configuration file to define the experimental\narrangement and can be integrated into software pipelines for testing of variant callers\nor other genomic tools.\nConclusions: The DNA sequencing data generated by our simulator is representative\nof real data and integrates seamlessly with standard downstream analysis tools.
Loading....